National Repository of Grey Literature 3 records found  Search took 0.01 seconds. 
Text Classification Methods in the Context of Web Pages
Trstenský, Patrik ; Bartík, Vladimír (referee) ; Burget, Radek (advisor)
This work deals with the issue of text classification in the context of websites. It examines available classification methods and their accuracy over web page plain text. It deals with constructing a dataset for training these methods for a specific domain. We obtain data for creating the dataset from publicly available websites that utilize RDF documents defined in HTML code. The conclusion of the work consists of the creation of two datasets for two different domains. Furthermore, the use of these datasets for training models and testing of their accuracy.
Otevřená data v oblasti vědeckého výzkumu
DOKTOR, Ondřej
This thesis focuses on the topic of open data in academic and research sectors and the openness, sustainability, and reusability of data which are being processed in diverse scientific communities. Since this topic is very broad, the thesis narrows its focus down to the author's home institution, the Faculty of Science, University of South Bohemia, Czech Republic, and its data sources in respect to the FAIR data principles. The main questions being discussed are: Is it possible to convert the current data foundation of research groups of the Faculty of Science of the University of South Bohemia into a FAIR form and how demanding would this transformation be? What data foundations now exist at the institution and in what form? What needs to be done to consider these data foundations as FAIR? What prevents the transformation of these data foundations into FAIR form? How is it possible to carry out this transformation in practice?
Analysis of schema.org utilization
Káva, Ján ; Svátek, Vojtěch (advisor) ; Mynarz, Jindřich (referee)
The bachelor thesis is about utilization of semantic model schema.org. The theoretical part provides an analysis of using schema.org with data standards Microdata, RDFa and JSON-LD. Furthermore the theoretical part is describing search engine optimization and posibilities of schema.org in its improving. In the practical part there is an analysis of utilization of schema.org in data formats Microdata, RDFa and JSON-LD, based on data extracted from a web crawl in October 2016.

Interested in being notified about new results for this query?
Subscribe to the RSS feed.